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(54) Thermostable DNA polymerase from carboxydothermus hydrogenoformans 

(57) A DNA polymerase from a thermophilic eubac- 
terium is provided. The DNA polymerase shows magne- 
sium ion dependent reverse transcriptase activity and 
3*-5' exonuciease activity. The invention also includes 
recombinant plasmids and transformed host cells capa- 
ble of produdng the enzyme. The enzyme is classified 
into class EC 2.7.7.7., a DNA nucleotidyl transferase 
DNA-directed type. 
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Description 

The present invention relates to a therrrrostable enzyme which is a DNA polymerase obtainable from Carboxydo- 
thermus hydrogenoformans. 

5 Furthermore, the present invention relates to the field of molecular biology and provides improved methods for the rep- 
lication and amplification of deoxyribonucleic (DNA) and ribonucleic acid (RNA) sequences. In a preferred embodiment, 
the invention provides a method for synthesizing a complementary DNA copy from an RNA template with a thermoreac- 
llve DNA polymerase. In another aspect, the invention provides methods for amplifying a DNA segment from an RNA 
or DNA tiemplate using a thermostable DNA polymerase (RT-PCR or PGR). 

10 Heat stable DNA polymerases (EC 2:7.7.7. DNA nucleotidyltransferase. DNA-directed) have been isolated from 
numerous thermophilic organisms (for example: Kaledin et al. (1980). Biokimiya 44. 644-651; Kaledin et al. (1981) 
Biokimiya 46. 1247-1254; Kaledin etal. (1982) Biokimiya 47. 1515-1521; Ruttimann et al. (1985) Eur. J. Biochem. 149. 
41-46; Neuner et al. (1990) Arch, f^icrobiol. 153, 205-207). 

For some organisms, the polymerase gene has been cloned and expressed (Lawyer et al. (1989) J. Biol. Chem. 264, 
15 6427-6437; Engeike et al. (1990) Anal. Biochem. 191, 396-400; Lundberg et al. (1991) Gene 108, 1-6; Perler et al. 

(1992) Proc. Natl. Acad. Sci. USA 89, 5577). 

Thermophilic DNA polymerases are increasingly becoming important tools for use in molecular biology and there 

is growing interest in finding new polymerases which have more suitable properties and activities for use in diagnostic 

detection of RNA and DNA, gene cloning and DNA sequencing. At present, the thermophilic DNA polymerases mostly 
20 used for these purposes are from Thermus species like Taq polymerase from T. aquaticus (Brock et al. (1969) J. Bac- 

teriol. 98. 289-297) 

Reverse transcription Is commonly performed with viral reverse transcriptases like the enzymes isolated from Avian 
myeloblastosis virus or Moloney murine leukemia virus, which are active in the presence of Magnesium ions but have 
the disadvantages to possess RNase H-activity, which destroys the template RNA during the reverse transcription reac- 

25 tlon and have a temperature optimum at at 42**C or 37°C, respectively. 

Alternative methodes are described using the reverse transcriptase activity of DNA polymerases of thermophilic 
organisms which are active at higher temperatures. Reverse transcription at higher temperatures Is of advantage to 
overcome secondary structures of the RNA template which could result in premature termination of products. Ther- 
mostable DNA polymerases with reverse transcriptase activities are commonly isolated from Thermus species. These 

30 DNA polymerases however, show reverse transcriptase activity only in the presence of Manganese ions. These reac- 
tion conditions are suboptinrial, because in the presence of Manganese ions the polymerase copies the template RNA 
with low fidelity. 

Another feature of the commonly used reverse transcriptases is that they do not contain 3 '-5 'exonuclease activity. 
Therefore, misincorporated nucleotides cannot be removed and thus the cDNA copies from the template RNA may con- 
35 tain a signtf leant degree of mutations. 

Therefore, it is desirable to develop a reverse transcriptase 

which acts at higher temperatures to overcome secondary structures in the template to 
avoid premature termination of the reaction and to assure the production of cDNA without deletions 
40 • which is active in the presence of Magnesium ions in order to prepare cDN A from RNA templates with higher fidelity 
and 

• which has 3 '-5 '-exonuclease in order to remove misincorporated nucleotides before continuation of DNA synthesis 
and to produce a product with a low mutation frequency. 

45 The present invention adresses these needs and provides a heat stable DNA polymerase active at higher temper- 
atures which has reverse transcriptase activity in the presence of magnesium ions and and which has 3 '-5 'exonuclease 
activity. 

It is an object of this invention to provide a polymerase enzyme (EC 2.7.7.7.), characterised in that it has reverse 
transcriptase activity in the presence of magnesium ions as well as in the presence of manganese ions. In another 

50 aspect the invention comprises a DNA polymerase isolated from Carboxydothermus hydrogenoformans (Deutsche 
Sammlung von Mikroorganismen und Zellkulturen GmbH, Mascheroder Weg lb. D-38124 Braunschweig, DSM No. 
8979). In a further aspect the invention comprises a DNA polymerase having reverse transcriptase activity in the pres- 
ence of magnesiums ions and in the substantial absence of manganese ions. In a further aspect the Invention com- 
prises a DNA polymerase having a molecular mass of about 100 to 105 kDa as determined by in situ PAGE analysis. 

55 In a further aspect the invention comprises a reverse transcriptase which is thermostable. In a further aspect the inven- 
tion comprises a DNA polymerase having 3 '-5 '-exonuclease activity. In a further aspect the invention comprises a 
recombinant DNA sequence that encodes DNA polymerase activity of the microorganism Cartoxidothermus hydrog- 
enoformans. In a related aspect, the DNA sequence is depicted as SEQ ID No. 7. In a second related aspect the inven- 
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tion comprises a recombinant DNA sequence that encodes essentially amino acid residues 1 to 831 . In a further aspect 
the invention comprises a recombinant DNA plasmid that comprises the DNA sequence of the invention inserted into 
plasmid vectors and which can be used to drive the expression of the thermostable DNA polymerase of Carboxydother- 
mus hydrogenoformans in a host cell transformed with the plasmid. In a further aspect the invention includes a recom- 

5 binant strain comprising the vector pDS56 carrying the Carboxydothermus hydrogenoformans DNA polymerase gene 
and designated pAR 4. The E.colt strain (BL21 (DE3)pUBS520) carrying the plasmid pAR4 was deposited on the Deut- 
sche Sammlung von Mikroorganismen und Zellkulturen GmbH. Mascheroder Weg lb. D-38124 Braunschweig DSM 
No. 11 1 79) and is designated AR96 

In referring to a peptide chain as being comprised of a series of amino acids "substantially or effectively" in accord- 

w ance with a list offering no alternatives within itself, we include within that reference any versions of the peptide chain 
bearing substitutions made to one or more amino acids in such a way that the overall structure and the overall function 
of the protein composed of that peptide chain is substarrtially the same as - or undetectably different to - that of the 
unsubstituted version. For example it is generally possible to exchange alanine and valine without greatly changing the 
properties of the protein, especially if the changed site or sites are at positions not critical to the morphology of the 

15 folded protein. 

The DNA polymerase is "thermostable" meaning that it is stable to heat and preferentially active at higher temper- 
atures, especially the high temperatures used for denaturation of DNA strands. More particularly, the thermostable DNA 
polymerases are not substantially inactivated at the high temperatures used in polymerase chain reactions^ 

The term "reverse transcriptase" describes a class of polymerases characterized as RNA-dependent DNA 

20 polymerases. All known reverse transcriptases require a primer to synthesize a DNA transcript from an RNA template. 
Historically, reverse transcriptase has been used primarily to transcribe mRNA into cDNA which can then be cloned into 
a vector for further manipulation. 

Other definitions are used in a manner consistent with the art. - - ^ — 

Carboxydothermus hydrogenoformans was isolated from a hot spring in Kamchatka by V. Svetlichny. A sample of 

25 C. hydrogenoformans was deposited on tiie Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSM) 
under the terms of the Budapest Treaty and received Accession Number DSM 8979. The thermostable polymerase iso- 
lated from Carboxydothermus hydrogenoformans has a molecular weight of 100 to 105 KDa and retains more than 60 
% of its initial activity after heating to 95°C for 30 minutes.The thermostable enzyme possesses a 5-3' polyrherase 
activity, a 3'-5'-exonuclease activity and a reverse transcriptase-activity which is Mg^*-dependent. The thernrostable 

3G enzyme may be native or recombinant and may be used for first- and second-strand cDNA synthesis, in cDNA cloning. 
DNA sequencing, DNA labeling and DNA amplification. 

For recovering the native protein C. hydrogenoformans may be grown using any suitable technique, such as the 
technique described by Svetiichny et al. (1991) System. Appl. Microbiol., 14, 205-208. After cell growth one preferred 
method for isolation and purification of the enzyme is accomplished using the multi-step process as follows: 

35 The cells are thawed, suspended in buffer A (40 mM Tris-HCI, pH 7.5, 0. 1 mM EDTA. 7 mM 2-mercaptoethanol, 0.4 
M NaCI. 10 mM Pefabloc) and lysed by twofold passage through a Gaulin homogenizer. The raw extract is cleared by 
centrifugation, the supernatant dialyzed against buffer B (40 mM Tris-HCI, pH 7.5, 0.1 mM EDTA, 7 mM 2-mercaptoeth- 
anol, 10 % Glycerol) and brought onto a column filled with Heparin-Sepharose (Pharmacia). In each case the columns 
are equilibrated with the starting solvent and after the application of tiie sample washed with the threefold of its volume 

40 witii this solvent. Elution of the first column is performed with a linear gradient of 0 to 0.5 M NaCI in Buffer B. The frac- 
tions showing polymerase activity are pooled and ammonium sulfate is added to a final concentration of 20 %. This 
solution is applied to a hydrophobic column containing Butyl-TSK-Toyopearl (TosoHaas). This time the column Is eluted 
with a falling gradient of 20 to 0 % ammonium sulfate. The pool containing the activity Is dialysed and again transferred 
to a column, this time with DEAE-Sepharose (Pharmacia), and eluted with a linear gradient of 0-0.5 M NaCI in buffer B. 

45 The fourth column contains Tris-Acryl-Blue (Biosepra) and is eluted as in the preceding case. Finally the active fractions 
are dialyzed against buffer C (20 mM Tris-HCI. pH 7.5. 0.1 mM EDTA. 7.0 mM 2-mercaptoethanol. 100 mM NaCI, 50 % 
Glycerol. 

Isolation of recombinant DNA polymerase from Carboxydothermus hydrogenoformans may be performed with the 
same protocol or with other commonly used procedures. 

so DNA polymerase activity was measured by incorporation of digoxigenin-labeled dUTP into tine sythesized DNA and 
detection and quantification of the incorporated digoxigenin essentially according to the method described In H6ltke. H.- 
J.; Sagner, G; Kessler, C. and Schmitz, G. (1992) Biotechniques 12, 104- 11 3. 

Determination of reverse transcriptase activity is performed essentially as described for determination of DNA 
polymerase activity except tiiat the reaction mixture consists of the components as described by example 3. 

55 In situ PAGE analysis of polymerase activity and reverse transcriptase activity was performed essentially according to 
the method described by Spanos A. and Hubscher U. ((1983) Methods in Enzymology 91, 263-277). Some minor, but 
essential modifications to the ohginal method are. that the renaturation of the SDS-denatured polypeptides is per- 
formed in the presence of magnesium ions (3 mM) and dATP (0.5-1 nM) to assist refolding. 
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3'-5' exonuclease activity is commonly referred as "proofreading" or "editing" activity of DNA polymerases . It is 
located in the small domain of the large fragment of Type A polymerases. This activity removes mispaired nucleotides 
from the 3' end of the primer terminus of DNA in the absence of nucleoside triphosphates (Kornberg A. and Baker 
T.A.(1992) DNA Replication W. H. Freeman & Connpany. New York). This nuclease action is suppressed by deoxynucl- 

5 eoside triphosphates if they match to the template and can be incorporated into the polymer 

The 3'- 5' exonuclease activity of the claimed DNA polymerase can be measured as degradation or shortening of 
a 5'-digoxygenin-labeled oligonucleotide annealed to template DNA in the absence or presence of deoxyribonucl eoside 
triphosphates or on DNA fragments in the absence or presence of deoxyribonucleoside triphosphates. 

Carboxydothermus hydrogenoformans DNA polymerase is the first DNA polymerase isolated from thermophilic 

10 eubaderia with a higher activity in the presence of magnesium ions than in the presence of manganese ions as shown 
in figure 1 . Compaired to the DNA polymerase activity the reverse transcriptase activity in the presence of magnesium 
is relatively high. This is shown - in comparison with DNA polymerases from T.filiformis. A.thernxphilum and the most 
commonly used DNA polymerase for reverse transcription T.thermophilus in figure 6. The reverse transcriptase activity 
in dependence of magnesium is of advantage since the DNA polymerases synthesize DNA with higher fidelity in the 

15 presence of magnesium than in the presence of manganese (Beckmann R. A. et al. (1985) Biochemistry 24, 5810- 
5817; Ricchetti fwl. and Buc H. (1993) EMBO J. 12, 387-396). Low fidelity DNA synthesis is likely to lead to mutated cop- 
ies of the original template. In addition, Mr^* ions have been implicated in an increased rate of RNA degradation, par- 
ticularly at higher temperatures and this can cause the synthesis of shortened products in the reverse transalptiori 
reaction. 

20 The DNA sequence (SEQ ID No.: 7) of Carboxydothermus hydrogenoformans polymerase and the derived amino 
acid sequence of the enzyme are shown in figure 5. The molecular weight deduced from the sequence is 94 348 Da, 
in SDS polyacrylamide gel electrophoresis however the Carboxydothermus hydrogenoformans polymerase has an 
- electrophoretic mobility higher than E.coli pol I (109 kDa) and a lower mobility than Taq polymerase (94 kDa) and Kle- 
now fragment (76 kDa) as shown in figure 2. Comparing the migration properties of Taq and E.coli DNA polymerases 

25 with those of Carboxydothermus hydrogenoformans polymerase a molecular weight of 100 to 1 05 kDa can be deduced. 
Since the Carboxydothermus hydrogenoformans polymerase isolated from the native strain has the same migration 
properties as the recombinant enzyme the ..slower" migration during SDS gel electrophoresis must rather be a property 
of the enzyme than a cloning artefact. A possible explanation for this phenomenon could be that this enzyme which is 
derived from a thermophilic organism has a very stable structure which is not completely unfolded under the standard 

30 denaturation conditions used. 

The production of a recombinant form of Carboxydothermus hydrogenoformans DNA polymerase generally 
includes the following steps: chromosomal DNA from Carboxydothermus hydrogenoformans is isolated by treating the 
cells with detergent e.g. SDS and a proteinase e.g. Proteinase K. The solution is extracted with phenol and chloroform 
and the DNA purified by precipitation with ethanol. The DNA is dissolved in Tris/EDTA buffer and the gene encoding the 

35 DNA polymerase is specifically amplified by the PCR technique using two mixed oligonucleotides (primer 1 and 2). 
These oligonucleotides, described by SEQ ID No.: 1 and SEQ ID No.: 2. were designed on the basis of conserved 
regions of family A DNA polymerases as published by Braithwaite D. K. and Ito J.. 1993, Nucl. Acids Res. Vol. 21, p. 
787 - 802. The specifically amplified fragment is ligated into an vector, preferably the pCR^II vector (Invitrogen) and the 
sequence is determined by cyde-sequencing. Complete isolation of the coding region and the flanking sequences of 

40 the DNA polymerase gene can be performed by restriction fragmentation of the Carboxydothermus hydrogenoformans 
DNA with another restriction enzyme as in the first round of screening and by inverse PCR (Innis et al., (1990) PCR 
Protocols; Academic Press, Inc., p. 219-227). This can be accomplished with synthesized oligonucleotide primers bind- 
ing at the outer DNA sequences of the gene part but in opposite orientation. These oligonucleotides described by SEQ 
ID Nos. 3 and 4 , were designed on the basis of the sequences which were determined by sequencing of the first PCR 

45 product described above. As template Carboxydothermus hydrogenoformans DNA is used which is cleaved by restric- 
tion digestion and circularized by contacting with T4 DNA ligase. To isolate the coding region of the whole polymerase 
gene, another PCR is performed using primers as shown in SEQ ID Nos. 5 and 6 to amplify tiie complete DNA polymer- 
ase gene directly from genomic DNA and introducing ends compatible with the linearized expression vector. 

50 SEQ ID No. 1: 

Primer 1 : 5 -CCN AAY YTN CAR AAY ATH-3 ' 

SEQ ID No. 2: 

Primer 2: 5'-YTC RTC RTG NAC YTG-3' 
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SEQ ID No. 3: 

Primer 3: 5'-GGG CGA AGA CGC TAT ATT CCT GAG C-3' 
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SEQ ID NO. 4: 

Primer 4: 5'-GAA GCC TTA ATT CAA TCT GGG AAT AAT C-3' 




SEQ ID NO. 5: 

5 Primer 5: 5'-CGA ATT CAA TCC ATG GGA AAA GTA GTC CTG GTG GAT-3' 

SEQ ID NO. 6: 

Primer 6: 5'-CGA ATT CAA GGA TCC TTA CTT CGC TTC ATA CCA GTT-3' 

w The gene is operably linked to appropriate control sequences for expression in either prokaryotic or eucaryotic 
host/vector systems. The vector preferably errcodes all functions required for transformation and maintenance in a suit- 
able host, and may encode selectable markers and/or corrtrol sequences for polymerase expression. Active recom- 
binant thermostable polymerase can be produced by transformed host cultures either continuously or after induction of 
expression. Active thennostable polymerase can be recovered either from host cells or from the culture media if the pro- 

15 tein is secreted through the cell membrane. 

It is also preferable that Carboxydothermus hydrogenoformans thermostable polymerase expression is tightly con- 
trolled in E.coli during cloning and expression. Vectors useful in practicing the present invention should provide varying 
degrees of controlled expression of Carboxydothermus hydrogenoformans polymerase by providing some or all of the 
following control features: (1) promoters or sites of initiation of transaiptkjn. either directiy adjacent to the start of the 

20 polymerase gene or as fusion proteins. (2) operators which could be used to turn gene expression on or'off. (3) ribos- 
ome binding sites for improved translation, and (4) transcription or translation termination sites for improved stability. 
Appropriate vectors used in cloning and expression of Carboxydothermus hydrogenoformans polymerase include. for 
example, phage and plasmids. Example of phage include lambda gtll (Promega). lambda Dash (Stratagehe) lambda 
Zapll (Stratagene). Examples of plasmids include pBR322, pBTac2 (Boehringer Mannheim). pBluescript (Stratagene). 

25 pSP73 (Promega), pET3A (Rosenberg, SLpra. (1987) Gene 56:125-135). pASK75 (Biometra). pDS56 (Stbber. D.. Mat- 
lie. H. and Garotta G. (1990) Immunological Methods. Letkovcs. I. and Pernis. B.. eds.) and pET1lC"(Studier, F. W. 
(1990) Methods in Enzymology, 185:60-89). According to the present invention the use of a plasmid has shown to be 
advantageously particularly pDS56. The Plasmid pDS56 carrying the Cartxjxydothermus hydrogenoformans DNA 
polymerase Gen Is then designated pAR4. - ' - " 

30 Standard protocols exist for transformation, phage infection and cell culture (Maniatis. et al. (1982) Molecular Clon- 
ing: A Laboratory Manual. Cold Spring Hartour Laboratory Press). Of the numerous E.coli strains which can be used 
for plasmid transformation, the preferred strains include JM110 (ATCC 47013). LE392 pUBS 520 (Maniatis et al. supra; ~ 
Brinkmann et al., (1989) Gene 85:109-1 14;). JM101 (ATCC No. 33876), XL1 (Stratagene), and RR1 (ATCC no. 31343), 
BL21 (DE3) PUBS520 (Brinkmann. U. et al. (1989) Gene 85. 109-114) and BL21 (DE3) plysS (Studier. F. W. et aL. 

35 (1990) Methods in Enzymology. supra). According to the present invention the use of the E. coli s;train BL21 (DE3) 
pUBS520 has shown to be advantageously. The E. coli strain BL21 (DE3) pUBS520 Uansformed with the plasmid pAR4 
is then designated AR96(DSM No. 11179). E.coli strains XLI-Blue (Stratagene), and ER1458 (Raleigh; E. A. et al.. 
(1988) Nucleic Acids Research 16:1563-1575) are among the strains that can be used for lambda phage, and Y1089 
can be used for lambda gt1 1 lysogeny. The transformed cells are preferably grown at 37^C and expression of the cloned 

40 gene is induced with IPTG. - 

Isolation of the recombinant DNA polymerase can be performed by standard techniques. Separation and purifica- 
tion of the DNA polymerase from the E.coli extract can be performed by standard methods. These methods include, for 
example, methods utilizing solubility such as salt precipitation and solvent precipitation, methods utilizing the difference 
in molecular weight such as dialysis, ultra-filtration, gel-filtration, and SDS-polyacrylamide gel electrophoresis, methods 

45 utilizing a difference in electric charge such as ion-exchange column chromatography, methods utilizing specific inter- 
action such as affinity chromatography, methods utilizing a difference in hydrophobicity such as reversed-phase high 
performance liquid chromatography and methods utilizing a difference in isoelectric point such as isoelectric focussing 
electrophoresis. 

The present Invention provides improved methods for efficiently transcribing RNA and amplifying RNA or DNA. 
50 These improvements are achieved by the discovery and application of previously unknown properties of thermoactive 
DNA polymerases. 

The thermostable enzyme of this invention may be used for any purpose in which such enzyme activity Is neces- 
sary or desired. In a particularly preferred embodiment, the enzyme catalyzes the nucleic acid amplification reaction 
known as PCR. This process for amplifying nucleic acid sequences is disclosed and claimed in U.S. Patent No. 
55 4,683,202. The PCR nucleic acid amplification method Involves amplifying at least one specific nucleic acid sequence 
contained in a nucleic acid or a mixture of nudeic acids and produces double-stranded DNA. Any nucleic acid 
sequence, in purified or nonpurified form, can be utilized as the starting nucleic acid(s). provided it contains or is sus- 
pected to contain the specific nucleic acid sequence desired. The nucleic acid to be amplified can be obtained from any 



5 



1^ EP 0 834 569 A1 



source, for example, from plasmids such as pBR322, from cloned DNA or RNA. from natural DNA or RNA from any 
source, including bacteria, yeast viruses, organelles, and higher organisms such as plants and animals, or from prep- 
arations of nucleic acids made in vitro. 

DNA or RNA may be extracted from blood, tissue material such as chorionic villi, or amniotic cells by a variety of 

5 techniques. See. e.g., Maniatis et al., 1982. Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory. 
Cold Spring Harbor, New York) pp. 280-281. Thus the process may employ, for example, DNA or RNA. including mes- 
senger RNA. which DNA or RNA may be single-stranded or double-stranded. In addition, a DNA- RNA hybrid which 
contains one strand of each may be utilized. 

The amplification of target sequences in DNA or from RNA may be performed to proof the presence of a particular 

70 sequence in the sanple of nucleic acid to be analyzed or to clone a specific gene. DNA polymerase from Carboxydo- 
thermus hydrogenoformans is very useful for these processes. Due to its 3'-5' exonuclease activity it is able to synthe- 
size products with higher accuracy as the reverse transcriptases of the state of the art. 

DNA polymerase from Carboxydothermus hydrogenoformans may also be used to simplify and improve methods 
tor detection of RNA target molecules in a sample. In these methods DNA polymerase from Carboxydothermus hydrog- 

15 enoformans may catalyze: (a) reverse transcription, (b) second strand cDNA synthesis, and. if desired, (c) amplif icatipn 
by PCR. The use of DNA polymerase from Carboxydothermus hydrogenoformans in the described methods would 
eliminate the previous requirement of two sets of incubation conditions which are necessary due to the use of different 
enzymes for each step. The use of DNA polymerase from Carboxydothermus hydrogenoformans may be used to per- 
form RNA reverse transcription and amplHicatlon of the resulting complementary DNA with enhanced specificity and 

20 with fewer stepslhan previous RNA cloning and diagnostic methods. 

Brief description of the drawings 

Fig. 1 shows the relative reverse transcriptase activity of DNA polymerase from Carboxydothermus hydrogenofor- 
25 mans in dependence of magnesium and manganese salt. 

Fig. 2 shows a photograph of a reverse transcriptase assay performed in situ. The reverse transcriptase activity of 
DNA polymerase from Carboxydothermus hydrogenoformans and reference polymerases is detected in situ. A fraction 
of DNA polymerase from C. hydrogenoformans was submitted to electrophoresis on a SDS-polyaaylamide gel contain- 
ing polyA/oligodT After electrophoresis the SDS was removed, the proteins were renatured and incubated at 65°C in 
30 the presence of magnesium salt. dNTPs and digoxigenin labeled dUTP to allow synthesis of the complementary strand. 
The nucleic acid was blotted to a nylon membrane and the newly synthesized DNA detected by a chemiiuminescence 
reaction. 

Rg. 3 shows the thermostability of DNA polymerase from Carboxydothermus hydrogenoformans. Aliquots of the 
DNA polymerase were Incubated for 30 min. at the temperatures indicated in the figure, and subsequently the remain- 

35 ing enzyme activity was determined. 

Fig. 4 shows the analysis for 3 '-5 '-exonuclease activity of DNA polymerase from Carboxydothermus hydrogenofor- 
mans in comparison with DNA polymerase from Thermus aquaticus and Pyrococcus woeseii. The analysis is per- 
formed in the presence or absence of dNTPs. A 22mer primer labeled with digoxigenin at the 5 '-end was annealed to 
a 34mer template DNA leaving a 12 bp 5'overhang of template DNA. DNA polymerases from Carboxydothermus 

40 hydrogenoformans. Thermus aquaticus and Pyrococcus woeseii were incubated with this substrate in the presence of 
magnesium with or without dNTPs. The products were separated on a sequencing gel, blotted to a nylon membrane 
and detected by a chemiiuminescence reaction. 

Rg. 5 shows the DNA sequence of the polymerase gene of Carboxydothermus hydrogenoformans with SEQ ID 
No.: 7 and the derived peptide sequence of the DNA polymerase protein. 

45 Fig. 6 shows a comparison of the reverse transcriptase activity thermostable DNA polymerases Carboxydothermus 
hydrogenoformans. Anaerocellum thermophilum, Thermus filiformis (Pacrtic Enzymes) and Thermus thermophilus. 
Similar amounts (units) of the DNA polymerases were analyzed. Each enzyme was tested for DNA polymerase activity, 
for reverse tanscriptase activity in the presence of Mg*-" (5 mM) and reverse transcriptase activity in the presence of 
Mn""* (1 mM) under the reaction conditions optimal for the individual enzymes. DNA synthesis was measured by incor- 

50 poration of digoxigenin-labeled nucleotides. In order to compare the ratio of DNA polymerase to reverse transcriptase 
activity, the relative light units (RLU) measured in the DNA polymerase assay was set to 100. The RLUs measured in 
the reverse transcriptase activity tests are expressed as percent of the polymerase activity. 
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Example 1 

Detection of endonuclease, exonuclease and ribonuclease activities: 

Absence of endonuclease activity: 1 ^ig of plasmid DNA is incubated for 4 hours with an excess of purified DNA 
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polymerase in 50 \i\ of test buffer with a paraffin oil overlay at 72°C. 
Absence of nonspecific exonuclease activity: 1 of EcoRI/Hindlll-fragments of lambda DNA are incubated in 100 ^l 
of test buffer In the absence and presence of dNTPs (1mM final concentration each) with an excess of purified DNA 
polymerase for 4 hours at 72*C. 

Absence of ribonudease activity: 3 \ig of MS2 RNA are Incubated with an excess of DNA polymerase in 20 ^1 of test 
buffer for 4 hours at 72°C. The RNA is subsequently analyzed by electrophoresis in a MOPS gel (Maniatis et al. (1982) 
fwldecular Cloning: A Laboratory Manual. Cold Spring Harbor. New York). 

Example 2 

Determination of DNA polymerase activity 

DNA polymerase activity was measured by incorporation of digoxigenin-labeled dUTP into the sythesized DNA and 
detection and quantification of the Incorporated digoxigenin essentially according to the method described in HOItke, H.- 
J.; Sagner. G; Kessler, C. and Schmitz. G. (1992) Biotechniques 12. 104-113. The reaction is performed in a reaction 
volume of 50 \i\ containing 1 or 2 ^il of diluted (0.05 U - 0.01 U) DNA polymerase and 50 mM Tris-HCI, pH 8.5; 12.5 mM 
(NH4)2S04; 10 mM KCI; 5 mM MgCl2; 10 mM 2-mercaptoethanol; 33 ^M dNTPs; 200 jig/ml BSA:12 ^g of DNAse {-acti- 
vated DNA from caff thymus and 0.036 digoxigenin-dUTP. - : 

The samples are incubated for 30 min. at 72°C, the reaction is stopped by addition of 2 jii 0.5 M EDTA. arxi the 
tubes placed on ice. After addition of 8 5 M NaCI and 150 fil of Ethanol (precooled to -20'*C) the DNA is precipitated 
by incubation for 1 5 min, on ice and pelleted by centrifugation forlO min at 13000 x rpm and 4°C. The pellet is washed 
with 100 mI of 70% Ethanol (precooled to -20**C) and 0.2 M NaCI. centrifuged again and dried under vacuum. 

The pellets are dissolved in 50 \l\ Tris-EDTA (10 mM/0.1 mM; pH 7.5). 5 jil of the sample are spotted into a well of 
a nylon mentjrane bottomed white microwell plate (Pall Filtrationstechnik GmbH. Dreieich; FRG; product no: 
SM045BWP). The DNA is fixed to the membrane by baking for 10 min. at 70°C. The DNA loaded wells are filled with 
100 of 0.45 nm-filtrated 1 % blocking solution (100 mM maleic acid, 150 mM NaCI. 1 % (w/v) casein. pH 7.5). All fol- 
lowing incubation steps are done at room temperature. After incubation for 2 min. the solution is sucked through the 
membrane with a suituable vacuum manifold at -0.4 bar. After repeating the washing step, the wellis are-filled with 100 
li\ of a 1 ;10 000<iilution of Anti-digoxigenin-AP, Fab fragments (Boehringer Mannheim, FRG. no: 1 093274) diluted in the 
above blocking solution. After incubation for 2 min. and sucking this step Is repeated once. The wells are washed twice 
under vacuum with 200 nl each time washing-buffer 1 (100 mM maleic-acid. 150 mM NaCI, 0.3 -%(v/v) Tween'" 20 
(Poly(oxyethylen)n-Sorbitan-monolaurat), pH 7.5). After washing another two times under vacuum w\ih 200 ]s\ each time 
washing-buffer 2 (10 mM Tris-HCI. 100 mM NaCI, 50 mM MgClg. pH 9.5) the wells are incubated for 5 miri. with 50 ^l 
of CSPD^" (Boehringer Mannheim, no: 1655884), diluted 1 : 100 in washing-buffer 2. which serves as a chemilunr^nes- 
cent substrate for the alkaline phosphatase. The solution is sucked through the membrane and after 10 min. incubation 
the RLU/s (Relative Light Unit per second) are detected in a Lxjminometer e.g. MicroLumat LB 96 P (EG&G Berthold, 
Wibad. FRG). 

With a serial dilution of Taq DNA polymerase a reference curve is prepared from which the linear range serves as 
a standard for the activity determination of the DNA polymerase to be analyzed. 

Example 3 

Determination of reverse transcriptase activity 

The assay is performed essentially as described for determination of DNA polymerase activity except that the reac- 
tion mixture consists of the following components: 1 ^ig of polydA-(dT)i5, 33 ^M of dTTP, 0.36 pM of digoxIgenin-dUTP. 
200 mg/ml BSA. 10 mM Tris-HCI, pH 8.5. 20 mM KCI. 5 mM MgCta, 10 mM DTE and various amounts of DNA polymer- 
ase The incubation temperature used is 50**C, 

Example 4 

Detection of DNA polymerase activity in situ 

In situ PAGE analysis of polymerase activity and reverse transcriptase activity was performed essentially according 
to the method described by Spanos A. and Hubscher U. (1983) Methods in Enzymology 91 . 263-277. Some minor, but 
essential modifications to the original method are. that the renaturation of the SDS-denatured polypeptides is per- 
formed in the presence of magnesium ions (3 mM) and dATP (0.5-1 pM) to assist refolding. 




In brief the method is as follows; 
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After separation of the polypeptides from either crude cell extracts or purified samples on denaturing 8 % polyacr- 
ylamide gels (stacking gel 5 % acrylamide) which contain 150 ^ig activated calf thymus DNA per ml gel volume, the gels 
5 are washed four times for 30 min each at room temperature with moderate shaking in excess renaturation buffer (Tris- 
HCI, 50 mM, pH 8.3; EDTA, 1 mM; 2-mercaptoethanol, 3 mM; KCI, 50 mM; glycerol, 5-10%) to remove SDS. Then the 
gels are incubated overnight in the same buffer, including 3 mM MgCl2 and 0.5-1 jiM dATP at 4*'C without agitation. The 
first four washes are repeated the next day with renaturation buffer. 

Subsequent to the removal of SDS and renaturation of the proteins the gel is transferred into reaction mixture, consist- 
10 ing of Tris-HCI, 50 mM, pH 8.3; KCI. 50 mM; DTT, 3 mM; MgClg. 7mM; 12 jiM of each dATR dCTP. dGTR 8 nM dTTP 
and 4 jiM Dig-dUTP; 10 % (vol/vol) glycerol. Gets were first incubated under shaking at room temperature for one hour 
and then warmed up stepwise to 37*^0, 45''C. 55*'C. 65°C and 72*C. At each incubation tenperature DNA synthesis was 
allowed to proceed for 60 minutes. 

After DNA synthesis, the DNA is transferred either by contact blotting or by capillary blotting (15 x SSC, Maniatis et 
15 al.. supra) to nylon membranes (Boehringer Mannheim, GmbH) and crosslinked. 

- The detection of newly synthesized, digoxigenin-labeled DNA followed the procedure given in the previous section 
(Determination of DNA polymerase activity). 

For molecular weight determination marker polymerases of known molecular weight (e.g. Klenow-polymerase, Pol 
t, Taq polymerase, Tth polymerase, HIV RT, M-MuLV RT) are applied onto the same gel, but different lanes. 
20 . The molecular weight of the claimed DNA polymerase according to this method is 100 to 105 kDa. 

Example 5 

Detection of 3'-5'exonuclease activity 

25 

3'- 5* exonuclease activity is commonly referred as '^proofreading" or "editing" activity of DNA polymerases . It is 
located in the small domain of the large fragment of Type A polymerases. This activity removes nucleotides from the 3' 
end of the primer terminus of DNA in the absence of nucleoside triphosphates (Kornberg A. and Baker T A.(1992) DNA 
Replication W. H. Freemann & Company, New York). This nuclease action is suppressed by deoxynudeoside triphos- 
30 phates if they match to the template and can be incorporated into the polymer. 

The 3'-5' exonuclease activity of the claimed DNA polymerase can be measured as degradation or shortening of a 
5'-digoxygenin-labeled oligonucleotide annealed to template DNA in tiie absence or presence of deoxyribonucleoside 
triphosphates or on DNA fragments in the absence or presence of deoxyribonucleoside triphosphates. 

Degradation of digoxigenin labeled oligonucleotide: The reaction mixture is essentially the same as that for deter- 
35 mination of DNA polymerase activity (50 mM Tris-HCI, pH 8.4, 12.5 mM (NH4)2S04; 10 mM KCI; 5 mM MgCl2 10 mM 
2-mercaptoethanol. except that the dNTP concentration was reduced to 12.5 ^M and activated calf thymus DNA was 
replaced by 500 fMol primer or template/primer mixture. 

The primer sequence is: 

40 

SEQ ID NO. 8.: 

DIg-GCATGGATCCCACTGCCCAGGG (5' to 3'). This primer is annealed with template molecules of various 1 2 bp 
5 prime overhangs. DNA polymerase samples of typically 0.1 units are incubated in a total reaction volume of 10 
)il for 30 min at 72*'C in a Perkin Elmer thermal cycler. Reactions are stopped by adding an equal volume of forma- 
45 mide-buffer (98 % formamide; 10 mM EDTA; bromphenol blue and xylencyanol) and denatured by heating for 10 
min at 95° C. Samples are quickly chilled on ice and loaded on a 20 % denaturing pdyacrylamide/urea sequencing 
gel. Electrophoresis is performed at 60°C and 2000 V for 2.5 hours. 

After separation DNA is transferred onto a positively charged nylon membrane (Boehringer Mannheim) by contact 
50 blotting for 30 min. The DNA is crosslinked to the membrane by UV-irradiation with 120 mJoule (Stratal inker. Strata- 
gene). The membrane is blocked with blocking solution (100 mM maleic-acid, 150 mM NaCI, 1 %(wA^) casein, pH is 
adjusted to 7,5 with 1 M NaOH) at room temperature for at least 30 min. digoxigenin-labeled primer DNA is detected 
with anti Digoxigenin-AP, Fab-fragments (Boehringer Mannheim, FRG, no 1093274) diluted 1:10000 in blocking solu- 
tion (30 min at room temperature). Excess unbound antibody is removed by washing 3-4 times (10-15 min, each step) 
55 with washing buffer (100 mM maleic-acid, 150 mM NaCI. 0.3% (v/v) Tween^" 20 (Poly(oxyethylen)n-sorbitan-monolau- 
rat). pH 7 5). The membrane is transferred into a buffer containing 10 mM Tris-HCI, 100 mM NaCI, pH 9,5) and washed 
twice for additional 10-15 min at room temperature. Finally the membrane is soaked with a 1 :1000 diluted solution of 
CDP-Star'" (Boehringer Mannheim). CDP-Star^" serves as a chemiluminiscent substrate for alkaline phosphatase. 



8 
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Then the membrane is transferred on filter paper (Whatman 3MM) to remove excess fluid, positioned between two 
sheets of transparent overhead foils and exposed to X-ray films (Chemiluminescent Detection Rim. Boehringer Man- 
nheim) for 5- 1 0 min. 3' -5* exonuclease activity is detected by degradation or shortening of the primer compared with a 
control (no polymerase added). As negative and positive corrtrols DNA polymerases from Thermus aquaticus (no 3 ' to 
5 'exonuclease activity) and from Pyrococcus woeseii (exhibiting 3 'to 5'exonuclease activity) are included. 

Degradation of DNA fragments in the presence or absence of deoxynucleoside triphosphates: A series of dilutions 
of Chy polymerase was incubated for 2 hours at 70°C with 1 ^g of DNA molecular weight marker III (Boehringer Man- 
nheim) in the presence and absence of dNTPs, 1 mM each, in 50 of the following incuabation buffer: 50 mM Tris-HCI, 
pH 7.8; 10 mM MgCl2; 7 mM 2-mercaptoethanol with Paraffin overlay. The DNA fragments were separated on a 1 % 
agarose gel containing ethidium bromide. In the absence ogf dNTPs a smear of DNA fragments or no DNA could be 
detected while in the presence of dNTPs the DNA fragments remained undegraded. 

Example 6 

Cloning of the Carboxydothermus hydrogenoformans DNA polymerase gene. 
Preparation of chromosomal DNA from Carboxydothermus hydrogenoformans. 

0.8 g biomass of Carboxydothermus hydrogenoformans was suspended in 20 mil M KCI and centrifuged. -Then 
the pellet was resuspended In 4.8 ml SET-buffer (1 50 mM NaCI, 1 5 mM EDTA, pH 8.0, 60 mM Tris-HCI,:pH 8.0, 50 ^g/^l 
RNaseA), after which 1 ml 20 % SDS and 50 jil of proteinase K (10 mg/ml) were added. The mixture was kept at 37°C 
for 45 minutes. After extraction with phenol and chloroform the DNA was precipitated with ethanol and dissolved in H2O. - 
Thus about 4, 1 mg of DNA were obtained. 

Amplification of specific DNA by PCR. * " 

For amplification of the gene encoding the DNA polymerase of Carboxydothermus hydrogenoformans by the PCR 
technique two mixed oligonucleotides (primer 1 and 2) were designed on the basis of conserved regions of family A 
DNA polymerases as published by Braithwaite D.K. and Ito J. (1993) NucL Acids Res. 21, 787-802. - . 

SEQIDNo.:1 

Primer 1 : 5'-CCN AAY YTN CAR AAY ATH-3' 
SEQIDNo.:2 

Primer 2: 5 -YTC RTC RTG NAC YTG-3 ' 

The PCR amplification was performed In 100 ^1 buffer containing 750 ng of genomic DNA from Cartxjxydothennus 
hydrogenoformans, 10 mM Tris-HCI, pH 8.8, 2.5 mM MgCl2. 50 mM KCI, 200 ^M dNTPs, 100 pmoles of each primer 
and 2.5 units of Taq polymerase (Boehringer Mannheim GmbH, FRG). The target sequence was amplified by first dena- 
turing at 95^C for 2 mIn. followed by 30 cycles of 95*'C for 0.5 min, 47''C for 1 min. and 72**C for 2 minutes. Thermal 
cycling was performed In a Perkin Elmer GenAmp 9600 thermal cycler. Agarose gel electrophoresis showed, that a 
fragment of approximately 600 base pairs was amplified specifically. This fragment was iigated into the pCR^"ll vector 
(Invrtrogen) and the sequence determined by cycle-sequencing. The amino acid sequence deduced from this nucle- 
otide sequence was very similar to that of other known DNA polymerases, so that primer 3 and 4 could be designed for 
inverse PCR. 

SEQIDNo.:3 

Primer 3: 5*-GGG CGA AGA CGC TAT ATT CCT GAG C-3' 
SEQ ID No.: 4 

Primer 4: 5'-GAA GCC TTA ATT CAA TCT GGG AAT AAT C-3' 

Inverse PCR was performed essentially as described in Triglia T et al. (1988) Nucleic Acids Res. 16. 8186. 5 ^g 
genomic DNA from Carboxydothermus hydrogenoformans were cleaved by EcoRI according to supplier's specifications 
(Boehringer Mannheim GmbH) and treated with an equal volume of phenol/chloroform mixture. The aqueous phase 
was removed, the DNA precipitated with ethanol and collected by centrifugation. 

For circularizatlon the digested DNA was diluted to a concentration of 50 ng/^l in ligation buffer (Boehringer Man- 
nheim GmbH. FRG). The ligation reaction was initiated by the addition of T4 DNA Ligase (Boehringer Mannheim 
GmbH, FRG) to a concentration of 0.2 units/^l and the reaction was allowed to proceed for 15 hrs at 15°C. The Iigated 
DNA was then precipitated with ethanol and collected by centrifugation. 
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The PGR was performed in 50 jil buffer containing 50 mM Tris-CI. pH 9.2, 16 mM (NH4)2S04. 2.25 mM MgCl2, 2 % 
(v/v) DMSO, 0.1 % (v/v) Tween^" 20, 700 ng of circularized DNA obtained as described above, 50 pmoles of each 
primer, 500 dNTP and 0.75 fil enzyme mix (Expand Long Template PGR System, Boehringer Mannheim GmbH). 

The cycle conditions were as follows: 
1 X denaturation of template for 2 min. at 92°C 



10 



lOx 



denaturation at 92°C for 1 0 sec. 
annealing at 64°C for 30 sec. 
elongation at SS'^C for 2 min. 



IS 



20 



20 X 



denaturation at 9TC for 10 sec. 
annealing at 64°C for 30 sec. 
elongation at 68°C for 2 min. 
+ cycle elongation of 20 sec. for each cycle 



25 

Agarose gel electrophoresis revealed a specifically amplified DNA fragment 7.000 base pairs long. The DNA frag- 
ment was ligated into the pGR^"ll vector (Invitrogen) and sequenced. Deduced from this sequence primer 5 and 6 cod- 
ing for the 5'- and 3'-ends. respectively, of the polymerase region could be designed. Primer 5 contained a Ncol site and 
primer 6 contained a BamHI site, 
30 The PGR was performed under the same conditions as described above (inverse PGR) using 750 ng genomic DNA 
from Garboxydothermus hydrogenoformans as template. 

SEQID No.: 5 

Primer 5: 5'-GGA ATT GAA TCC ATG GGA AAA GTA GTC CTG GTG GAT-3' 

35 

SEQID No.: 6 

Primer 6: 5'-CGA ATT GAA GGA TGC TTA CTT GGC TTC ATA GGA GTT-3' 
Gloning and expression. 

40 

The PGR product was purified by electrophoresis of 20 of the PGR mixture on a 0.8 % agarose gel. The 2,496 
kb band of the polymerase coding region was purified from the agarose by phenol extraction. The DNA was then treated 
with chloroform and precipitated with ethanol. The pellet was resuspended and digested with Ncol and BamHI accord- 
ing to supplier's specifications (Boehringer Mannheim GmbH) to give cohesive ends for directional cloning. The DNA 

45 was ligated into the expression vector pDS56 (Stiiber D., Matile H. and Garotta G. (1990) Immunological Methods. Letk- 
ovcs, I and Pernis, B.. eds.) that had also been digested with Ncol and BamHI. The ligated products were introduced 
into Exoli strain BL21(DE3) pUBS520 (Brinkmann U. et al. (1989) Gene 85, 109-1 14) by transformation. Transformants 
were grown on L-agar containing 100 ^ig/ml ampicillin and 50 ng/ml kanamycin to allow selection of recombinants. Gol- 
onies were picked and grown in L-broth containing 100 ng/ml ampicillin and 50 ^g/ml kanamycin, and plasmid DNA was 

50 prepared by alkaline lysis. The plasmids were screened for insertions by digestion with BamHI. Those recombinants 
containing inserts were grown in L-broth containing ampicillin and kanamycin and tested for the expression of ther- 
mophilic DNA polymerase by induction of exponentially growing culture with 1 mM IPTG and assaying the heat-treated 
extracts for DNA polymerase activity as described above (determination of DNA polymerase activity), A recombinant 
expressing the DNA polymerase from Garboxydothermus hydrogenoformans was obtained. The strain was designated 

55 E.coli AR96 (DSM No. 1 1 1 79) and the plasmid pAR4. 



10 
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SEQUENCE LISTING 



5 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Boehringer Mannheim GmbH 

(B) STREET: Sandhof erstr . 116 

(C) CITY: Mannheim 

(E) COUNTRY: DE 

(F) POSTAL CODE (ZIP): 68305 

(G) TELEPHONE: 06217595482 

(H) TELEFAX: 06217594457 



(ii) TITLE OF INVENTION: Thermostable DNA Polymerase from 
Carboxydothermus hydrogenof ormans 

(iii) NUMBER OF SEQUENCES: 8 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release t^l.O, Version #1.30 (EPO) 
(2) INFORMATION FOR SEQ ID NO: 1: 



20 



40 



45 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



50 



(ii) 



MOLECULE TYPE: other nucleic acid 



(A) DESCRIPTION: 



/desc = "oligonucleotide 



tt 



55 



11 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
CCNAAYYTNC ARAAYATH 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

YTCRTCRTGN ACYTG 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE:" other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGGCGAAGAC GCTATATTCC TGAGC 




12 



35 



40 



45 



BO 
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(2) INFORMATION FOP. SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
75 (A) DESCRIPTION: /desc = "oligonucleotide* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GAAGCCTTAA TTCAATCTGG GAATAATC 

(2) INFORMATION FOR SEQ ID NO: 5: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 36 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CGAATTCAAT CCATGGGAAA AGTAGTCCTG GTGGAT 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



13 
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(ii) MOLECULE TYPE: other nucleic acid 



(A) DESCRIPTION: 



/desc = "oligonucleotide 



ft 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CGAATTCAAG GATCCTTACT TCGCTTCATA CCAGTT 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2496 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 
{0) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 

(ix) FEATURE; 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . .2496 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG GGA AAA GTA GTC CTG GTG GAT GGA AAT AGT TTA TTA CAT AGA 
Met Gly Lys Val Val Leu Val Asp Gly Asn Ser Leu Leu His Arg 
15 10 15 

GCG TTT TTT GCC CTT CCG CCC TTA AAA ACT ACT AAA GGA GAG CCT 
Ala Phe Phe Ala Leu Pro Pro Leu Lys Thr Thr Lys Gly Glu Pro 



ACC GGG GCG GTT TAC GGG TTT TTA ACG ATG CTT TTT CGG GTA ATA 
Thr Gly Ala Val Tyr Gly Phe Leu Thr Met Leu Phe Arg Val lie 



20 



25 



30 



35 



40 



45 
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AAA GAT GAA AAA CCC GAA TAT TTA GCG GTA GOT TTT GAT ATT AGC 
Lys Asp Glu Lys Pro Glu Tyr Leu Ala Val Ala Phe Asp lie Ser 
50 55 60 



180 



CGG AAA ACT TTT CGT ACC GAG CAG TTT ACT GCA TAC AAA GGG CAC 
^0 Arg Lys Thr Phe Arg Thr Glu Gin Phe Thr Ala Tyr Lys Gly His 

65 70 75 



225 



15 



CGC AAA GAA GCC CCG GAT GAG CTT GTA CCC CAG TTT GCC CTG GTG 
Arg Lys Glu Ala Pro Asp Glu Leu Val Pro Gin Phe Ala Leu Val 
80 85 90 



270 



20 



25 



30 



35 



CGG GAA GTA TTA AAG GTT TTA AAT GTT CCC TAT ATT GAA CTT GAC 

Arg Glu Val Leu Lys Val Leu Asn Val Pro Tyr He Glu Leu Asp 

95 100 105 

GGT TAT GAG GCC GAT GAT ATT ATC GGC CAC CTA TCA AGG GCT TTT 

Gly Tyr Glu Ala Asp Asp He He Gly His Leu Ser Arg Ala Phe 

110 115 120 

GCG GGA CAA GGA CAT GAA GTG GTG ATT TAT ACC GCT GAC CGG GAC 

Ala Gly Gin Gly His Glu Val Val He Tyr Thr Ala Asp Arg Asp 

125 130 135 



315 



360 



405 



40 



ATG CTG CAA TTG GTA GAT GAA AAA ACG GTG GTA TAC CTT ACC AAA 
Met Leu Gin Leu Val Asp Glu Lys Thr Val Val Tyr Leu Thr Lys 
140 145 150 



450 



45 



AAA GGC ATT ACC GAA CTG GTT AAA ATG GAT TTA GCT GCG ATT TTA 
Lys Gly He Thr Glu Leu Val Lys Met Asp Leu Ala Ala He Leu 
155 160 165 



495 



GAA AAC TAC GGC TTA AAG CCT AAA CAG CTT GTG GAT GTT AAA GGA 
50 Glu Asn Tyr Gly Leu Lys Pro Lys Gin Leu Val Asp Val Lys Gly 

170 175 180 



540 



55 



15 
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0 



TTA ATG GGA GAT CCC TCG GAC AAC ATA CCC GGG GTT CCC GGG ATT 
Leu Met Gly Asp Pro Ser Asp Asn He Pro Gly Val Pro Gly He 
185 190 195 



585 



GGG GAG AAA ACT GCT TTA GAT TTA ATT AAA ACT TAT GGC TCA GTG 
^o Gly Glu Lys Thr Ala Leu Asp Leu He Lys Thr Tyr Gly Ser Val 



200 



205 



210 



630 



GAA GAA GTT TTG GCC CGT AAA GAT GAG TTA AAA CCT AAA TTA AGA 
Glu Glu Val Leu Ala Arg Lys Asp Glu Leu Lys Pro Lys Leu Arg 
215 220 225 



675 



20 



GAA AAG CTT GCC GAA CAC GAA AAT TTA GCA AAA ATA TCG AAA CAA 
Glu Lys Leu Ala Glu His Glu Asn Leu Ala Lys He Ser Lys Gin 
230 235 240 



720 



25 



30 



35 



40 



45 



TTA "GCT ACA ATC CTG CGG GAA ATA CCG TTA GAA ATC TCC CTG GAA 
Leu Ala Thr He Leu Arg Glu He Pro Leu Glu He Ser Leu Glu 
245 250 255 

GAT TTA AAA GTT AAA GAA CCT AAT TAT GAA GAA GTT GCT AAA TTA 
Asp Leu Lys Val Lys Glu Pro Asn Tyr Glu Glu Val Ala Lys Leu 
250 265 270 

TTT CTT CAC CTT GAG TTT AAA AGC TTT TTA AAA GAA ATA GAA CCA 
Phe Leu His Leu Glu Phe Lys Ser Phe Leu Lys Glu He Glu Pro 
275 280 285 

AAA ATA AAG AAA GAA TAC CAG GAA GGT AAA GAT TTG GTG CAA GTT 
Lys He Lys Lys Glu Tyr Gin Glu Gly Lys Asp Leu Val Gin Val 
290 295 300 



765 



810 



855 



900 



50 



GAA ACT GTA GAA ACG GAA GGA CAG ATT GCA GTA GTT TTT AGT GAT 
Glu Thr Val Glu Thr Glu Gly Gin He Ala Val Val Phe Ser Asp 
305 310 315 



945 
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990 



10 



GGA TTT TAT GTT GAT GAC GGG GAA AAA ACA AAG TTT TAC TCT TTA 
Gly Phe Tyr Val Asp Asp Gly Glu Lys Thr Lys Phe Tyr Ser Leu 
320 325 330 

GAC CGG CTG AAT GAA ATA GAG GAA ATA TTT AGG AAT AAA AAA ATT 1035 
Asp Arg Leu Asn Glu lie Glu Glu lie Phe Arg Asn Lys Lys He 
335 340 345 

ATT ACC GAC GAT GCC AAA GGA ATT TAT CAT GTC TGT TTA GAA AAA 1080 
lie Thr Asp Asp Ala Lys Gly He Tyr His Val Cys Leu Glu Lys 
350 355 360 

GGT CTG ACT TTT CCC GAA GTT TGT TTT GAT GCG CGG ATT GCA GCT 1125 
Gly Leu Thr Phe Pro Glu Val Cys Phe Asp Ala Arg He Ala Ala 
365 370 375 

25 TAT GTT TTA AAC CCG GCC GAC CAA AAT CCC GGC CTC AAG GGG CTT 1170 

Tyr Val Leu Asn Pro Ala Asp Gin Asn Pro Gly Leu Lys Gly Leu 
380 385 390 



15 



20 



30 



35 



40 



45 



TAT CTA AAG TAT GAC TTA CCG GTG TAT GAA GAT GTA TCT TTA AAC 1215 
Tyr Leu Lys Tyr Asp Leu Pro Val Tyr Glu Asp Val Ser Leu Asn 
395 400 405 

ATT AGA GGG TTG TTT TAT TTA AAA AAA GAA ATG ATG AGA AAA ATC 1260 
He Arg Gly Leu Phe Tyr Leu Lys Lys Glu Met Met Arg Lys He 
410 415 420 

TTT GAG CAG GAG CAA GAA AGG TTA TTT TAT GAA ATA GAA CTT CCT 1305 
Phe Glu Gin Glu Gin Glu Arg Leu Phe Tyr Glu He Glu Leu Pro 
425 430 435 



TTA ACT CCA GTT CTT GCT CAA ATG GAG CAT ACC GGC ATT CAG GTT 1350 
Leu Thr Pro Val Leu Ala Gin Met Glu His Thr Gly He Gin Val 
so 440 445 450 



55 



17 




18 



EP 0 834 569 A1 |^|||^ 



GAA CTC GGT CGG AAA TTA CGC AAG ATG TTT ATA CCT TCA CCG GGG 
Glu Leu Gly Arg Lys Leu Arg Lys Met Phe lie Pro Ser Pro Gly 
590 595 600 

TAT GAT TAT ATT GTT TCG GCG GAT TAT TCC CAG ATT GAA TTA AGG 
Tyr Asp Tyr He Val Ser Ala Asp Tyr Ser Gin He Glu Leu Arg 
605 610 615 

CTT GTT GCG CAT TTT TCC GAA GAG CCC AAG CTT ATT GAA GCT TAG 
Leu Leu Ala His Phe Ser Glu Glu Pro Lys Leu He Glu Ala Tyr 
620 625 630 

CAA AAA GGG GAG GAT ATT CAC CGG AAA ACG GCC TCC GAG GTG TTC 
Gin Lys Gly Glu Asp lie His Arg Lys Thr Ala Ser Glu Val Phe 
635 640 645 

GGT GTA TCT TTG GAA GAA GTT ACT CCC GAG ATG CGC GCT CAT GCC 
Gly Val Ser Leu Glu Glu Val Thr Pro Glu Met Arg Ala His Ala 
650 655 660 

AAG TCG GTG AAC TTC GGC ATT GTT TAT GGC ATT AGT GAT TTT GGT 
Lys Ser Val Asn Phe Gly He Val Tyr Gly He Ser Asp Phe Gly 
665 670 675 

TTA GGC AGA GAC TTA AAG ATT CCC CGG GAG GTT GCC GGT AAG TAG 
Leu Gly Arg Asp Leu Lys He Pro Arg Glu Val Ala Gly Lys Tyr 
680 685 690 

ATT AAA AAT TAT TTT GCC AAC TAT CCC AAA GTG CGG GAG TAT CTC 
He Lys Asn Tyr Phe Ala Asn Tyr Pro Lys Val Arg Glu Tyr Leu 
695 700 705 

GAT GAA CTT GTC CGT ACG GCA AGA GAA AAG GGA TAT GTG ACC ACT 
Asp Glu Leu Val Arg Thr Ala Arg Glu Lys Gly Tyr Val Thr Thr 
710 715 720 
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TTA TTT GGG CGA AGA 
Leu Phe Giy Arg Arg 
725 

CGC ACG GTT CAG GGT 
Arg Thr Val Gin Gly 
740 

CTT CAG GGC TCG GCT 
Leu Gin Gly Ser Ala 
755 

GTA GAA AAA GAA CTT 
Val Glu Lys Glu Leu 
770 

CTT TCG GTG CAC GAT 
Leu Ser Val His Asp 
785 

CTG GAA GAG GTA AAA 
Leu Glu Glu Val Lys 
800 

GTT GAA CTG AAA GTG 
Val Glu Leu Lys Val 
815 

AAC TGG TAT GAA GCG 
Asn Trp Tyr Glu Ala 
830 



CGC TAT ATT CCT GAG 
Arg Tyr He Pro Glu 
730 

TTT GGC GAA AGG ACG 
Phe Gly Glu Arg Thr 
745 

GCC GAT ATT ATT AAG 
Ala Asp He He Lys 
760 

AAA GCC CGT AAG CTT 
Lys Ala Arg Lys Leu 
775 

GAG TTA GTT TTA GAA 
Glu Leu Val Leu Glu 
790 

GCG CTG GTA AAA GGG 
Ala Leu Val Lys Gly 
805 

CCT TTA ATC GCT GAA 
Pro Leu He Ala Glu 
820 

AAG TAA 
Lys * 



CTA TCT TCA AAA AAC 
Leu Ser Ser Lys Asn 
735 

GCC ATG AAT ACT CCC 
Ala Met Asn Thr Pro 
750 

CTT GCA ATG ATT AAT 
Leu Ala Met He Asn 
765 

AAG TCC CGG CTC CTT 
Lys Ser Arg Leu Leu 
780 

GTG CCG GCG GAA GAG 
Val Pro Ala Glu Glu 
795 

GTT ATG GAG TCG GTG 
Val Met Glu Ser Val 
810 

GTT GGT GCA GGC AAA 
Val Gly Ala Gly Lys 
825 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Gly Lys Val Val Leu Vai Asp Gly Asn Ser Leu Leu His Arg 

15 10 15 

Ala Phe Phe Ala Leu Pro Pro Leu Lys Thr Thr Lys Gly Glu Pro 

20 25 30 

Thr Gly Ala Val Tyr Gly Phe Leu Thr Met Leu Phe Arg Val lie 

35 40 45 

Lys Asp Giu Lys Pro Glu Tyr Leu Ala Val Ala Phe Asp lie Ser 

50 55 60 

Arg Lys Thr Phe Arg Thr Glu Gin Phe Thr Ala Tyr Lys Gly His 

65 70 75 

Arg Lys Glu Ala Pro Asp Glu Leu Val Pro Gin Phe Ala Leu Val 

80 85 90 

Arg Glu Val Leu Lys Val Leu Asn Val Pro Tyr lie Glu Leu Asp 

95 100 105 

Gly Tyr Glu Ala Asp Asp lie lie Gly His Leu Ser Arg Ala Phe 

110 115 120 

Ala Gly Gin Gly His Glu Val Val He Tyr Thr Ala Asp Arg Asp 

125 130 135 
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Met Leu Gin Leu Val Asp Glu Lys Thr Val Val Tyr Leu Thr Lys 

140 145 150 

Lys Gly He Thr Glu Leu Val Lys Met Asp Leu Ala Ala He Leu 

155 160 165 

Glu Asn Tyr Gly Leu Lys Pro Lys Gin Leu Val Asp Val Lys Gly 

170 175 180 

Leu Met Gly Asp Pro Serr Asp Asn He Pro Gly Val Pro Gly He 

185 190 195 

Gly Glu Lys Thr Ala Leu Asp Leu -He Lys Thr Tyr Gly Ser Val 

200 205 210 

Glu Glu Val Leu Ala Arg Lys Asp Glu Leu Lys Pro Lys Leu Arg 

215 220 225 

Glu Lys Leu Ala Glu His Glu Asn Leu Ala Lys He Ser Lys Gin 

230 235 240 

Leu Ala Thr He Leu Arg Glu He Pro Leu Glu He Ser Leu Glu 

245 250 255 

Asp Leu Lys Val Lys Glu Pro Asn Tyr Glu Glu Val Ala Lys Leu 

260 265 270 

Phe Leu His Leu Glu Phe Lys Ser Phe Leu Lys Glu He Glu Pro 

275 280 285 

Lys He Lys Lys Glu Tyr Gin Glu Gly Lys Asp Leu Val Gin Val 

290 295 300 



Glu Thr Val Glu Thr Glu Gly Gin He Ala Val Val Phe Ser Asp 
305 310 315 
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Gly Phe Tyr Val Asp Asp Gly Glu Lys Thr Lys Phe Tyr Ser Leu 

320 325 330 

Asp Arg Leu Asn Glu lie Glu Glu lie Phe Arg Asn Lys Lys lie 

335 340 345 

lie Thr Asp Asp Ala Lys Gly lie Tyr His Val Cys Leu Glu Lys 

350 355 360 



75 



Gly Leu Thr Phe Pro Glu Val Cys Phe Asp Ala Arg lie Ala Ala 
365 370 375 
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Tyr Val Leu Asn Pro Ala Asp Gin Asn Pro Gly Leu Lys Gly Leu 
380 385 390 
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Tyr Leu Lys Tyr Asp Leu Pro Val Tyr Glu Asp Val Ser Leu Asn 
395 400 405 



30 



lie Arg Gly Leu Phe Tyr Leu Lys Lys Glu Met Met Arg Lys lie 
410 415 420 
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40 



Phe Glu Gin Glu Gin Glu Arg Leu Phe Tyr Glu lie Glu Leu Pro 
425 430 435 

Leu Thr Pro Val Leu Ala Gin Met Glu His Thr Gly He Gin Val 
440 445 450 

Asp Arg Glu Ala Leu Lys Glu Met Ser Leu Glu Leu Gly Glu Gin 
455 460 465 
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He Glu Glu Leu lie Arg Glu He Tyr Val Leu Ala Gly Glu Glu 
470 475 480 



Phe Asn Leu Asn Ser Pro Arg Gin Leu Gly Val He Leu Phe Glu 
485 490 495 



55 
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Lys Leu Gly Leu Pro Val lie Lys Lys Thr Lys Thr Gly Tyr Ser 

500 505 510 

Thr Asp Ala Glu Val Leu Glu Glu Leu Leu Pro Phe His Glu lie 

515 520 525 

lie Gly Lys lie Leu Asn Tyr Arg Gin Leu Met Lys Leu Lys Ser 

530 535 540 

Thr Tyr Thr Asp Gly Leu Met Pro Leu He Asn Glu Arg Thr Gly 

545 550 555 

Lys Leu His Thr Thr Phe Asn Gin Thr Gly Thr Leu Thr Gly Arg 

560 565 570 

Leu Ala Ser Ser Glu Pro Asn Leu Gin Asn lie Pro He Arg Leu 

575 580 585 

Glu Leu Gly Arg Lys Leu Arg Lys Met Phe He Pro Ser Pro Gly 

590 595 600 

Tyr Asp Tyr He Val Ser Ala Asp Tyr Ser Gin He Glu Leu Arg 

605 610 615 

Leu Leu Ala His Phe Ser Glu Glu Pro Lys Leu He Glu Ala Tyr 

620 625 630 

Gin Lys Gly Glu Asp He His Arg Lys Thr Ala Ser Glu Val Phe 

635 640 645 

Gly Val Ser Leu Glu Glu Val Thr Pro Glu Met Arg Ala His Ala 

650 655 660 



Lys Ser Val Asn Phe Gly He Val Tyr Gly He Ser Asp Phe Gly 
665 670 675 
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Leu Gly Arg Asp Leu Lys lie Pro Arg Giu Val Ala Gly Lys Tyr 

680 685 690 

He Lys Asn Tyr Phe Ala Asn Tyr Pro Lys Val Arg Glu Tyr Leu 

695 700 705 

Asp Glu Leu Val Arg Thr Ala Arg Glu Lys Gly Tyr Val Thr Thr 

710 715 720 

Leu Phe Gly Arg Arg Arg Tyr He Pro Glu Leu Ser Ser Lys Asn 

725 730 735 

Arg Thr Val Gin Gly Phe Gly Glu Arg Thr Ala Met Asn Thr Pro 

740 745 750 

Leu Gin Gly Ser Ala Ala Asp He He Lys Leu Ala Met He Asn 

755 760 765 

Val Glu Lys Glu Leu Lys Ala Arg Lys Leu Lys Ser Arg Leu Leu 

770 775 780 

Leu Ser Val His Asp Glu Leu Val Leu Glu Val Pro Ala Glu Glu 

785 790 795 

Leu Glu Glu Val Lys Ala Leu Val Lys Gly Val Met Glu Ser Val 

800 805 810 

Val Glu Leu Lys Val Pro Leu He Ala Giu Val Gly Ala Gly Lys 

815 820 825 



Asn Trp Tyr Glu Ala Lys *^ 
830 



Claims 

1. A purified thermostable DNA polymerase obtainable from Carboxydothermus hydrogenoformans. 
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2. A purilied thermostable DNA polymerase according to claim 1 that catalyses the template directed polymerisation 
of DNA. posesses 3'-5' exonuclease activity and has reverse transcriptase activity in the presence o1 magnesiums 
ions. 

5 3. A purified thermostable DNA polymerase as claimed in any of one claims 1-2 wherein said polymerase exhibit 
reverse transaiptase activity in the presence of magnesium ions and in the substantial absence of manganese 
ions. 

4. The polymerase as claimed in any of claims 1 -3, wherein said polymerase exhibits a reverse transcriptase activity 
10 which is Mn^"^ dependent. 

5. The polymerase as claimed in any of claims 1-4, wherein the magnesium ions dependent reverse transcriptase 
activity of said polymerase is higher than the DNA polymerase activity of said polymerase. 

15 6. The polymerase as claimed in any of claims 1-5» wherein the magnesium ions dependent reverse transcriptase 
activity of said polymerase is higher than the manganese dependent reverse transcriptase activity of said polymer- 
ase. 

7. The polymerase as claimed in any of claims 1-6, wherein said polymerase has an apparent molecular weight 
20 between about 100 to 105 kDa as determined by SDS polyacrylamide electrophoresis. 

8. The polymerase as claimed in any of claims 1-7. wherein said polymerase is obtainable from E. coli BL21 (DES) 
pUBS520, the strain being designated AR96. 

25 9. An isolated DNA sequence coding for the polymerase as claimed in any one of claims 1 -8 obtainable from Carbox- 
ydothermus hydrogenoformans. 

10. A recombinant DNA sequence capable of encoding polymerase activity of the microorganism Carboxydothermus 
hydrogenoformans. 

30 

11. An isolated DNA sequence represented by the formula shown in SEQ ID No. 7. 

12. A vector containing the isolated DNA sequence as claimed in any of claims 9-11 . 

35 13. The vector according to claim 12. wherein such vector is plasmid pDS56 carrying the Carboxydothermus hydrog- 
enoformas DNA polymerase gene and is then designated pAR4. 

14. The vector according to claims 12 and 13 providing some or all of the following features: 

40 (1 ) promoters or sites of initiation of transcription 

(2) operators which could be used to turn gene expression on or off 

(3) ribosome binding sites for improved translation 

(4) transcription or translation termination sites 

45 1 5. A microbial host transformed with the vector of claims 12-14. 

16. A microbial host according to claim 15 wherein said transformant is from E. coli BIJ21 (DES) pUBS520. the strain 
being designated AR96. 

50 1 7. A process for the preparation of DNA polymerase according to any of the claims 1 -8 comprising the steps: 

(a) cultering the natural strain Carboxydothermus hydrogenoformas 

(b) suspending the cells of the natural strain In buffer 

(c) disrupting the cells 

55 (d) purifying the DNA polymerase by chromatographic steps including the use of one or more Sepharose-coi- 

umns. 

1 8. A process for the preparation of DNA polymerase according to any one of claims 1 -8 comprising growing a recom- 
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binant E. coli strain translormed with a vector according to daims 12-14 and purifying and isolating the DNA 
polymerase. 

19. A process of amplifying DNA, characterized in that a thennostable DNA polymerase as claimed in any one of 
5 claims 1-8 is used. 

20. A process for second cDNA cloning and DNA sequencing, characterized in that a thermostable DNA polymerase 
as claimed in any one of claims 1-8 is used. 

10 21 . A process for DNA labeling, characterized in that a thermostable DNA polymerase as claimed in any one of claims 
1-8 is used. 

22. A process for reverse transcription, characterized in that a thermostable DNA polymerase as claimed in any one of 
claims 1-8 is used. 
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Figure 1 : 
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Figure 2: 

1 2 3 4 5 6 7 




1 E.coli pol I (20 U) + Klenow fragment (10 U) 

2 Cell extract from C.hydrogenoformans (2 

3 Chy polymerase, rec. (7 U) 

4 Taq polymerase (20 U) 

5 Cell extract from C. hydrogenoformans (2 |il) 

6 Chy polymerase, rec. (14 U) 

7 E.coli pol I ( 1 0 U) + Klenow fragment (5 U) 

8 Cell extract from C. hydrogenoformans (4 fil) 

9 Chy polymerase, rec. (70 U) 

1 0 Taq polymerase (20 U) 
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Figure 5 : 

SEQ ID No . : 7 

ATG GGA AAA GTA GTC CTG GTG GAT 

MG KVVLV.D 
TTT TTT GCC CTT CCG CCC TTA AAA 

FFAL P PLK 
GCG GTT TAG GGG TTT TTA ACG ATG 

AVYGFLTM 
AAA CCC GAA TAT TTA GCG GTA GCT 

KPEYLAVA 
CGT ACC GAG GAG TTT ACT GCA TAC 

RTEQFTAY 
GAT GAG CTT GTA CCC CAG TTT GCC 

DELVPQFA 
TTA AAT GTT CCC TAT ATT GAA CTT 

LNVPYIEL 



GGA AAT AGT TTA TTA CAT AGA GCG 48 

GNSLLHRA16 
ACT ACT AAA GGA GAG CCT ACC GGG 96 

TTKGEPTG32 
CTT TTT CGG GTA ATA AAA GAT GAA 144 

LFRVIKDE48 
TTT GAT ATT AGC CGG AAA ACT TTT 192 

FDISRKTF64 
AAA GGG CAC CGC AAA GAA GCC CCG 240 

K G H R K E A P 80 
CTG GTG CGG GAA GTA TTA AAG GTT 2 88 

LVREVLKV96 
GAC GGT TAT GAG GCC GAT GAT ATT 3 36 

DGYEADD I 112 
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ATC GGC CAC CTA TCA AGG GCT TTT 

IGHLSRAF 
ATT TAT ACC GCT GAC CGG GAC ATG 

lYTADRDM 
GTG GTA TAC CTT ACC AAA AAA GGC 

VVYLTKKG 
TTA GCT GCG ATT TTA GAA AAC TAC 

LAAILENY 
GAT GTT AAA GGA TTA ATG GGA GAT 

DVKGLMGD 
CCC GGG ATT GGG GAG AAA ACT GCT 

PGIGEKTA 
TCA GTG GAA GAA GTT TTG GCC CGT 

S VEEVLAR 
AGA GAA AAG CTT GCC GAA CAC GAA 

REKLAEHE 



GCG GGA CAA GGA CAT GAA GTG GTG 384 

AGQGHEVV 128 

CTG CAA TTG GTA GAT GAA AAA ACG 4 32 

LQLVDEKT 144 

ATT ACC GAA CTG GTT AAA ATG GAT 480 

ITELVKMD 160 

GGC TTA AAG CCT AAA CAG CTT GTG 528 

GLKPKQLV 176 

CCC TCG GAC AAC ATA CCC GGG GTT 57 6 

PSDNIPGV 192 

TTA GAT TTA ATT AAA ACT TAT GGC 624 

LDLIKTYG 208 

AAA GAT GAG TTA AAA CCT AAA TTA 672 

KDELKPKL 224 

AAT TTA GCA AAA ATA TCG AAA CAA 720 

NLAKISK Q 240 
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TTA GCT ACA ATC CTG CGG GAA ATA CCG 

LATILREIP 
TTA AAA GTT AAA GAA CCT AAT TAT GAA 

LKVKEPNYE 
CAC CTT GAG TTT AAA AGC TTT TTA AAA 

HLE FKSFLK 
AAA GAA TAG CAG GAA GGT AAA GAT TTG 

KEYQEGKDL 
ACG GAA GGA CAG ATT GCA GTA GTT TTT 

TEGQIAVVF 
GAG GGG GAA AAA ACA AAG TTT TAG TCT 

DGEK TKFYS 
GAG GAA ATA TTT AGG AAT AAA AAA ATT 

EEIFRKKKI 
ATT TAT CAT GTC TGT TTA GAA AAA GGT 

lYHVCLEKG 



TTA GAA ATC TCC CTG GAA GAT 768 

L E I S L E D 256 

GAA GTT GCT AAA TTA TTT CTT 816 

E V A K L F L 272 

GAA ATA GAA CCA AAA ATA AAG 864 

E I E P K I K 288 

GTG CAA GTT GAA ACT GTA GAA 912 

V Q V E T V E 304 

AGT GAT GGA TTT TAT GTT GAT 960 

S D G F Y V D 320 

TTA GAC CGG CTG AAT GAA . ATA 1008 

L D R L N E I 336 

ATT ACC GAC GAT GCC AAA GGA 1056 

ITDDAKG3 52 

CTG ACT TTT CCC GAA GTT TGT 1104 

L T F P EV C 368 
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TTT GAT GCG CGG ATT GCA GCT TAT GTT TTA AAC CCG GCC GAG CAA AAT 1152 

FDARIAAYVLNPADQN 384 

CCC GGC CTC AAG GGG CTT TAT CTA AAG TAT GAG TTA CCG GTG TAT GAA 1200 

PGL KGLYLKYDLPVYE 400 

GAT GTA TCT TTA AAC ATT AGA GGG TTG TTT TAT TTA AAA AAA GAA ATG 1248 

DVSLNIRGLFYLKKEM 416 

ATG AGA AAA ATC TTT GAG CAG GAG CAA GAA AGG TTA TTT TAT GAA ATA 12 96 

MRKIFEQEQERLFYEI 432 

GAA CTT CCT TTA ACT CCA GTT CTT GCT CAA ATG GAG CAT ACC GGC ATT 1344 

ELPLTPVLAQMEHTGI 448 

CAG GTT GAC CGG GAA GCT TTA AAA GAG ATG TCG TTA GAG CTG GGA GAG 1392 

QVDREALKEMSLELGE 464 

CAA ATT GAA GAG TTA ATC CGG GAA ATT TAT GTG CTG GCG GGG GAA GAG 144 0 

Q I E E L I R E I Y V L A G E E 480 

TTT AAC TTA AAC TCG CCC AGG CAG CTG GGA GTT ATT CTT TTT GAA AAA 14 88 

FNLNSPRQLGVILFEK 496 
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CTT GGG CTG CCG GTA ATT 

L G L P V I 

GCG GAG GTT TTG GAA GAG 

A E V L E E 

ATA TTG AAT TAG CGG CAG 

I L N Y R Q 

GGC TTA ATG CCT TTA ATA 

G Ir M P L I 

TTT AAC CAG ACC GGT ACT 

F N Q T G T 

AAT CTC CAA AAT ATT CCC 

N L Q N I P 

AAG ATG TTT ATA CCT TCA 

K M F I P S 

TAT TCC CAG ATT GAA TTA 

Y S Q I E L 
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AAA AAG ACC AAA ACG GGC 

K K T K T G 
CTC TTG CCT TTC CAC GAA 

L L P F H E 
CTT ATG AAG TTA AAA TCC 

L M K L K S 
AAT GAG CGT ACC GGT AAA 

N E R T G K 
TTA ACC GGA CGC CTG GCG 

L T G R L A 
ATC CGG TTG GAA CTC GGT 

I R L E L G 
CCG GGG TAT GAT TAT ATT 

P G Y D Y I 
AGG CTT CTT GCC CAT TTT 

R L L A H F 



TAC TCT ACC GAT 1536 

Y S T D 512 
ATT ATC GGC AAA 1584 

I I G K 528 

ACT TAT ACT GAC 1632 

T Y T D 544 

CTT CAC ACT ACT 1680 

L H T T 560 

TCT TCG GAG CCC 172 8 

S S E P 576 

CGG AAA TTA CGC 1776 

R K L R 592 

GTT TCG GCG GAT 1824 

V S A D 608 
TCC GAA GAG CCC 1872 

SEEP 624 
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AAG CTT ATT GAA GCT TAG CAA AAA GGG 

KLIEAYQKG 
GCC TCC GAG GTG TTC GGT GTA TCT TTG 

ASEVFGVSL 
CGC GCT CAT GCC AAG TCG GTG AAC TTC 

RAHAKSVNF 
GAT TTT GGT TTA GGC AGA GAC TTA AAG 

DFGLGRDLK 
AAG TAC ATT AAA AAT TAT TTT GCC AAC 

KYIKNYFAN 
CTC GAT GAA CTT GTC CGT ACG GCA AGA 

LDELVRTAR 
TTA TTT GGG CGA AGA CGC TAT ATT CCT 

L FGRRRYIP 
ACG GTT CAG GGT TTT GGC GAA AGG ACG 

T V Q G F G E R T 



GAG GAT ATT CAC CGG AAA ACG 1920 

E D I H R K T 640 

GAA GAA GTT ACT CCC GAG ATG 1968 

E E V T P E M 656 

GGC ATT GTT TAT GGC ATT AGT 2016 

G I V Y G I S 672 

ATT CCC CGG GAG GTT GCC GGT 2064 

I P R E V A G 688 

TAT CCC AAA GTG CGG GAG TAT 2112 

Y P K V R E Y 704 

GAA AAG GGA TAT GTG ACC ACT 2160 

E K G Y V T T 720 

GAG CTA TCT TCA AAA AAC CGC 2208 

E L S S K N R 736 

GCC ATG AAT ACT CCC CTT CAG 2256 

A M N T P L Q 752 
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GGC TCG GCT GCC GAT ATT ATT AAG 

GSAADIIK 
GAA CTT AAA GCC CGT AAG CTT AAG 

ELKARKLK 
GAT GAG TTA GTT TTA GAA GTG COG 

DELi.VLEVP 
GCG CTG GTA AAA GGG GTT ATG GAG 

ALVKGVME 
TTA ATC GCT GAA GTT GGT GCA GGC 

LIAEVGAG 



CTT GCA ATG ATT AAT GTA GAA AAA 2304 

LAMINVE K 768 

TCC CGG CTC CTT CTT TCG GTG CAC 23 52 

SRLLLSVH 784 

GCG GAA GAG CTG GAA GAG GTA AAA 24 00 

AEELEEVK 800 

TCG GTG GTT GAA CTG AAA GTG CCT 2448 

S. VVELKVP 816 

AAA AAC TGG TAT GAA GCG AAG TAA 24 96 

KNWYEAK* 831 
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